CLEF-IP 2010: Retrieval Experiments in the Intellectual Property Domain
نویسنده
چکیده
In the recent decade that research in IR methods for Intellectual Property domain has increased. The rst e orts in observing how information retrieval is done in patent domain were done with the series of Nist workshops (see for example [2]). Lately, more workshops and conferences are dedicated to bringing together IR and IP specialists [3,8]. In 2008, the Irf obtained the agreement to coordinate two evaluation campaigns with emphasis on patent documents and prior art retrieval: Clef Ip and Trec Chem. The Clef Ip track was launched in 2009 to investigate IR techniques for patent retrieval and it was part of the CLEF 2009 evaluation campaign. In 2010, the track continued as a benchmarking activity of the Clef 2010 conference. The track utilizes a collection of more than 1.3 million patent documents derived from Epo (European Patent O ce) sources. The collection covers English, French and German with at least 150,000 documents in each language. There were two tasks in the 2010's track. The rst one is to nd patent documents that are candidates to constitute prior art for a given document. The second task is to classify a given document according to the International Patent Classi cation system (Ipc). Relevance judgements will be produced using the patent citations for the Prior Art Candidates search task and using the recorded classi cation codes for the Classi cation task. This notebook gives a report on the Clef Ip activity in 2010. The paper is structured as follows: Section 2 describes the test collection used this year, section 3 presents the participating teams and gives an overview of the methods the teams involved. In the same section we also present the main measurements done in this track.
منابع مشابه
CLEF-IP 2012: Retrieval Experiments in the Intellectual Property Domain
The Clef-Ip test collection was rst made available in 2009 to support research in IR methods in the intellectual property domain. Since then several kinds of tasks, re ecting various speci c parts of patent expert's work ows, have been organized. We give here an overview of the tasks, topics, assessments and evaluations of the Clef-Ip 2012 lab.
متن کاملCLEF-IP 2009: Retrieval Experiments in the Intellectual Property Domain
The Clef Ip track ran for the rst time within Clef 2009. The purpose of the track was twofold: to encourage and facilitate research in the area of patent retrieval by providing a large clean data set for experimentation; to create a large test collection of patents in the three main European languages for the evaluation of cross lingual information access. The track focused on the task of prior...
متن کاملReport on the CLEF-IP 2012 Experiments: Exploring Passage Retrieval with the PIPExtractor
This technical report presents the work carried out for the Patent Passage Retrieval track of CLEF-IP 2012. Our aim was to create IR-Platform independent module for the Passage Retrieval process. For the Document retrieval Method a Language Model based on IPC classes and for the Passage Retrieval a Passage Intellectual property Extractor (PIPExtractor) was implemented. Topics with the main lang...
متن کاملCLEF-IP 2011: Retrieval in the Intellectual Property Domain
The patent system is designed to encourage disclosure of new technologies and novel ideas by granting exclusive rights on the use of inventions to their inventors, for a limited period of time. Before a patent can be granted, patent o ces around the world perform thorough searches to ensure that no previous similar disclosures were made. In the intellectual property terminology, such kind of se...
متن کاملPatent Retrieval Experiments in the Context of the CLEF IP Track 2009
At CLEF 2009 the University of Hildesheim focused on the main task of the Intellectual Property Track which aims at finding prior art for a specified patent [cf. Information Retrieval Facility 2009]. The experiments of the University of Hildesheim concentrated on a baseline approach including stopword elimination, stemming and simple term queries. Furthermore only title and claim were included ...
متن کامل